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Using analytical methods and Monte Carlo simulations, 
we analyze new statistics designed to detect isolated step-like 
discontinuities which are coherent over large areas of Cos- 
mic Microwave Background (CMB) pixel maps. Such coher- 
ent temperature discontinuities are predicted by the Kaiser- 
Stebbms effect to form due to long cosmic strings present in 
our present horizon. The background of the coherent step-like 
seed is assumed to be a scale invariant Gaussian random field 
which could have been produced by a superposition of seeds 
on smaller scales and/or by inflationary quantum fluctuations. 
We find that the proposed statistics can detect the presense of 
a coherent discontinuity at a sensitivity level almost an order 
of magnitude better compared to more conventional statistics 
like the skewness or the kurtosis. 



I. INTRODUCTION 



The major progress achieved during the past 15 years 
in both theory and cosmological observations has turned 
the search for the origin of cosmic structure into one of 
the most exciting fields of scientific research. Despite the 
severe constraints imposed by detailed observational data 
on theories for structure formation the central question 
remains open: What is the origin of primordial fluctua- 
tions that gave rise to structure in the universe? Two 
classes of theories attempting to answer this question 
have emerged during the past twenty years and have 
managed to survive through the observational constraints 
with only minor adjustments. 

According to the first class, primordial fluctuations 
are produced by quantum fluctuations of a linearly cou- 
pled scalar field during a period of inflation jj]]. These 
fluctuations are subsequently expected to become classi- 
cal and provide the progenitors of structure in the uni- 
verse. Because of the extremely small linear coupling of 
the scalar field, needed to preserve the observed large 
scale homogeneity, the inflationary perturbations are ex- 
pected by the central limit theorem, to obey Gaussian 
statistics. This is not the case for the second class of 
theories. 

According to the second class of theories 0], primor- 
dial perturbations are provided by seeds of trapped en- 
ergy density produced during symmetry breaking phase 
transitions in the early universe. Such symmetry break- 



ing is predicted by Grand Unified Theories (GUT's) to 
occur at early times as the universe cools and expands. 
The geometry of the produced seeds, known as topologi- 
cal defects is determined by the topology of the vaccuum 
manifold of the physically realized GUT. Thus the defects 
may be pointlike (monopoles), linelike (cosmic strings), 
planar (domain walls) or collapsing pointlike (textures). 

The cosmic string theory [pj for structure formation 
is the oldest and (together with textures Q) best studied 
theory of the topological defect class. By fixing its single 
free parameter Gfi (/i is the effective mass per unit length 
of the wiggly string G is Newtons constant and we have 
used units with c = 1) to a value consistent with micro- 
physical requirements coming from GUT's (G/x ~ 10 -6 ), 
the theory is consistent with the noise in pulsar signal ar- 
rival times assuming that the noise is due to gravitational 
radiation emitted by the defect network ||. It may au- 
tomatically account for large scale filaments and sheets 
H , galaxy formation at epochs z ~ 2 — 3 and galac- 
tic magnetic fields Jp) . It can also provide large scale 
peculiar velocities |9| and is consistent with the ampli- 
tude, spectral index jio) and the statistics of the 
cosmic microwave background (CMB) anisotropies mea- 
sured by the COBE collaboration on large angular 
scales (9 ~ 10°). Other planned CMB experiments |l3} | 
of equally high quality but on smaller angular scales are 
expected to provide a wealth of information within the 
next few years. 

The CMB observations provide a valuable direct 
probe for identifying signatures of cosmic strings. The 
main mechanism by which strings can produce CMB fluc- 
tuations on angular scales larger than 1-2 degrees has 
been well studied both analytically ifbl and using numer- 
ical simulations |fjof and is known as the Kaiser- Stebbins 
effect fl5| . According to this effect, moving long strings 
present between the time of recombination t rec and the 
present time to, produce step- like temperature disconti- 
nuities between photons that reach the observer through 
opposite sides of the string. These discontinuities are due 
to the peculiar nature of the spacetime around a long 
string which even though is locally flat, globally has the 
geometry of a cone with deficit angle 8ttG)i. The mag- 
nitude of the discontinuity is proportional to the deficit 
angle, to the string velocity v s and depends on the rela- 
tive orientation between the unit vector along the string 
s and the unit photon wave- vector k. It is given by jl4| 



— = ±4nG^v s j s k ■ (v s x s) (1) 

where j s is the relativistic Lorentz factor and the sign 
changes when the string is crossed. The angular scale 
over which this discontinuity persists is given by the ra- 
dius of curvature of the string which according to simu- 
lations is approximately equal to the horizon scale. 
The growth of the horizon from t rec to to results in a 
superposition of a large number of step-like temperature 
seeds of all sizes starting from about 2° (the angular size 
of the horizon at t rec ) to about 180° (the present horizon 
scale). By the central limit theorem this large number of 
superposed seeds results in a pattern of fluctuations that 
obeys Gaussian statistics. Thus the probability distri- 
bution for the temperature of each pixel of a CMB map 
with resolution larger than about 1° — 2° is a Gaussian 
. It has therefore been considered to be impossible to 
distinguish structure formation models based on cosmic 
strings from corresponding models based on inflation, us- 
ing CMB maps with resolution angle larger than 1° — 2°. 
Theoretical studies have therefore focused on identify- 
ing the statistical signatures of cosmic strings on angular 
scales less than 1° where the number of superposed 
seeds is smaller and therefore the non-Gaussian character 
of fluctuations is expected to be stronger [] 

These efforts however have been faced with the com- 
plicated and model dependent physical processes occur- 
ing on small angular scales. Such effects include isolated 
foreground point sources, recombination physics, string 
properties on small scales (kinks, loops etc) which require 
detailed simulations of both the string network and the 
cosmic background, in order to be properly taken into 
account. Even though there are preliminary efforts for 
such detailed simulations jl7| , it has become clear that it 
will take some time before theory and experiments on an- 
gular scales less than a few arcmin reach accuracy levels 
leading to detectable non-Gaussian string signatures. 

An alternative approach to the problem is instead of 
focusing on small scales where the number of superposed 
seeds is small, to focus on larger angular scales where de- 
spite the large number of superposed seeds there is also 
coherence of induced fluctuations on large angular scales. 
Fluctuations on these scales may be viewed as a super- 
position of a Gaussian scale invariant background com- 
ing mainly from small scale seeds plus a small number of 
step-like discontinuities which are coherent and persist on 
angular scales of order 100°. These are produced by long 
strings present in our present horizon. Our goal is to find 



1 The non-Gaussian features for texture maps are stronger 
than thosed of cosmic strings mainly because of the generi- 
cally smaller number of textures per horizon volume |18|. 



a statistic optimized to detect this large scale coherence 
and use it to find the minimum amplitude of a coherent 
discontinuity that can be detected at the la — 2a level 
relatively to a given scale invariant or noise dominated 
Gaussian background. Such a statistic is equally effec- 
tive on any angular resolution scale and its effectiveness 
is only diminished as the number of pixels of the CMB 
map is reduced or the noise is increased. The statistical 
variable we focus on, in sections 2 and 3 is the Sample 
Mean Difference (SMD) of temperatures between large 
neighbouring sectors of a CMB map. These sectors are 
separated by a random straight line in two dimensional 
maps or by a random point in one dimensional maps. 
The union of the two sectors gives back the complete 
map. We show that the statistics of the SMD variable 
are much more sensitive in detecting the presence of a 
coherent step-like seed than conventional statistics like 
the skewness or the kurtosis. 

We also discuss an alternative statistic, the Maximum 
Sample Difference that is more sensitive in certain cases 
but less robust than the SMD. This statistic is based 
on finding the maximum from a large sample of temper- 
ature differences between large neighbouring sectors of 
CMB maps. We show that for noise dominated data the 
MSD statistic is even more sensitive than the SMD statis- 
tic. However, this sensitivity gets rapidly reduced when 
significant correlations are introduced in the underlying 
Gaussian data. Thus the MSD statistic is more sensitive 
but less robust compared to the SMD statistic. 

The structure of this paper is the following: In the 
next section titled 'Sample Mean Difference' we study 
analytically the statistics of the SMD variable and show 
that its average value is a sensitive quantity in detecting 
the presence of a randomly positioned step-function on 
top of a Gaussian map. We then compare with the sen- 
sitivity of the statistics skewness and kurtosis. We find 
that the sensitivity of the SMD statistic is significantly 
superior to that of skewness and kurtosis in detecting the 
step function. These analytical results are shown for the 
case of one-dimensional maps but the extension to the 
case of two dimensional maps is straightforward. 

In the third section titled Monte Carlo Simulations' 
we perform Monte Carlo simulations of Gaussian maps 
with and without step-like discontinuities in one and two 
dimensions. Applying the statistics skewness, kurtosis 
and average of SMD on these maps we verify the ana- 
lytical results of section 2 and find the minimum step- 
function amplitude that is detectable by the average 
SMD statistic. In section 4 we show both analytically 
and numerically that the MSD statistic can be signifi- 
cantly more sensitive and accurate even compared to the 
SMD statistic when applied to noise dominated data. 

Finally is section 5 we conclude, summarise and dis- 
cuss the prospect of applying the average of SMD statistic 



and the MSD statistic to presently available CMB maps 
including the COBE data. That analysis is currently in 
progress [|9| , and will be presented separately. 



II. SAMPLE MEAN DIFFERENCE 



Consider an one dimensional array of n pixel variables 
x n . Let these variables be initially distributed accord- 
ing to a standardised Gaussian probability distribution. 
Consider now a step-function of amplitude 2a superposed 
so that the discontinuity is between pixels io and io + 1 
(Fig. 1). The new probability distribution for a random 
pixel variable x is 



P(x) 



f 



1 



/2tt 



'2tt 



f c«+° 
-e ~ 



(2) 



where / = We are looking for a statistic that will 
optimally distinguish between a Gaussian array with a 
superposed step-function and a Gaussian array without 
one. The obvious statistics to try first are the moments 
of the distribution (2) with a = and a ^ 0. | 



H(a, f) = <X>=af- a(l - /) 
a 2 (a, f) = < {x — \x) 2 >= 1 + 4a 2 /(l - /) 
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_ 3 + 8a 2 /(3 + 2a 2 - 3f 2 - 8a 2 f + 12a 2 f 2 - 6a 2 / 3 ) 
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For a = we obtain the Gaussian values for the skew- 
ness and the kurtosis s(0, /) = , fc(0, /) = 3 as ex- 
pected. For a the moments deviate from the Gaus- 
sian values. In order to find the minimum value of a 
for which the moments can distinguish between a Gaus- 
sian pattern and a Gaussian+Step pattern we must com- 
pare the deviation of moments from their Gaussian values 
with the standard deviation of the sample moments. The 
mean values of the skewness and the kurtosis are easily 
obtained by integrating with respect to / from to 1 i.e. 
assuming that it is equally probable for the step-function 
to be superposed at any point of the lattice. 



s(a) = < s(a, /) >= / dfs{a, /) = 



(4) 



*(<*) = < *;(«,/)>= / dfk(a,f) (5) 
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These values are to be compared with the standard de- 
viations of the moments, obtained as follows: The vari- 
ance of the skewness over several n-pixel array realiza- 
tions with fixed / and a is 



As 2 (a,/) =< (s- S ) 2 > 



(G) 



Figure 1: A large scale coherent step- function discon- 
tinuity superposed on a one dimensional pixel map. 



where s = s ^+--+ s " [ s the sample skewness from a given 
pixel array realization, s is the actual skewness (for n — > 



The moment generating function corresponding to (2) 



n < si > 

< S >= =< Sl >= s 



(7) 



M(t) = fe at+ ~ + (1 - f)e 



(3) 



The mean /i(a,/), variance a 2 {a,f), skewness s{a, /) 
and kurtosis k(a, /) can be obtained in a straightforward 
way by proper differentiation of M(t) as follows: 



Also 



< s A >- 



<S 2 > +(1 - -) < 8j > 2 



where j any pixel number (j G [1, n]). Thus 
As 2 (aJ) = -(< S 2 >-< S] > 2 ) 



(8) 



(9) 



2 With no loss of generality we may assume a positive or 
because from the statistical point of view there is a «-> — a 
symmetry. 



Similarly for the variance of the sample kurtosis we have 
f 



Ak 2 (a, /) = -(< k 2 > - < k > 2 ) 



(10) 



with kj = -pc(xj — fi) 4 and < >= < (xj — /i) 8 >. 
It is straightforward to obtain all the above moments by 
differentiating the generating functional and using 



d n M, 
< xl l >= — — 
3 dt n 



\t=Q 



(11) 



Now the minimum value a m i n of a detectable at ltr level 
is obtained from the equations 



df[s(a m in, f) - As(a mm ,f)] = (12) 



df[{k(a min , /) - 3) - Ak(a min , /)] = (13) 



Since (from eq. (4)) s(a) = which is equal to the 
Gaussian value, the skewness can only be used to detect a 
step function by comparing the standard deviation As for 
a = and « ^ 0. By demanding As{a rn i n ) < 2As(q = 
0) we obtain a m in < 2.5. This result is independent of 
the number of pixels n. For the kurtosis we obtain from 
eqs. (10, 13) a m i n ~ 4 for n = 10 3 while for a m i n = 0.5, 
n ~ 10 6 is required. 

Using the alternative test i.e. demanding Afc(a m m) > 
2Afc(a = 0) we obtain a > 2 and this result is indepen- 
dent of the number of pixels n as in the case of skewness. 
Thus for the usual pixel maps where n is up to 0(1000) 
the kurtosis is not able to detect a step function with 
\a\ < 2 at the la level. ^ This result remains unchanged 
for other statistical variables defined by local linear com- 
binations of pixels (e.g. differences of neighbouring pixel 
variables [^l)) since the effect of a single discontinuity re- 
mains negligible if the long range coherence is not taken 
into account. 



For CMB temperature maps with (4^) rms 
the detectable value of G/i is 

a = 47rG/x(u s 7 s ) cos6> > 4 x 10~ 5 
=> G//(« s 7 s )cos(9~ 4 x 10~ 6 



2x 10 



-5 



(14) 



where 9 is an angle obtained from the relative orientation 
of the string with respect to the observer. According to 
simulations < v s ^ s >rms— 0.2 and for G/i < 2 x 10~ 5 
the detection of the Kaiser-Stebbins effect using statistics 
based on skewness and kurtosis is not possible. This 
excluded range however includes all the cosmologically 
interesting values of G/i. 

It is therefore important to look for alternative sta- 
tistical variables that are more sensitive in detecting the 



presence of coherent discontinuities superposed on Gaus- 
sian maps. It will be shown that the Sample Mean Dif- 
ference (SMD) is such a statistical variable. A similar 
and even more effective statistic, the Maximum Sample 
Difference will be discussed in section IV. 

Consider a pixel array (Fig. 1) of n pixel Gaussian 
random variables Xj with a step function covering the 
whole array, superposed such that the discontinuity is 
located just after pixel io- To every pixel k of the array 
we may associate the random variable defined as the 
difference between the mean value of the pixels 1 through 
k minus the mean value of the pixels k + 1 through n. It 
is straightforward to show that 



Y k = AX k + 2a 
Y k = AX k + 2a 



n - ip 
n — k 
io 
k 



k e [l,i ] 
k e [io, n—1] 



(15) 
(16) 



where AX k 



J= k+i X r Thus wc 



1 v-vfc y 1 \r^n 

l/c — k Z^j = l ^-j n -k l^r- 

have constructed a new array Y k , (fc = 1, n — 1) from 
the sample mean differences (SMD) of the original array. 
We will focus on the average value Z of the SMD defined 
as: 



Z = 



1 



n-l 



1 ^ 

k=l 



(17) 



Using eqs. (15-17) we obtain 

n— 1 io ■ i n — 1 . , 

^=^7[E^ + 2a(£l^ + £ 

n—1 f— ' f— ' 1 — k/n ^— ' k/n 

fe=l fe=i ' fc=i +l ' 



(18) 



With the definitions / = io/n and £ = k/n and the 
assumption n >> 1 wc obtain: 

-. n—1 

Z = r V AX k - 2a[(l - f) ln(l - /) + / In /] 

n—1 f-^ 



fc=i 



(19) 



Thus the mean of Z over many realizations of the array 
is 

i n—i pi 
<Z>= V < AX k > -4a[ / dff\nf]=a 

(20) 



3 As in all cases discussed in this paper a is measured in units 
of standard deviation (rms) of the underlying Gaussian map. 



The variance of Z is due both to the underlying Gaussian 
map and to the variation of / = io/n (assuming a fixed). 
The variance due to the gaussian background is 



'1,Z 



(n 



i yl.i i r 1 " 6 





(21) 



where e = £ = fc/n, n >> 1 and we have used the 

fact that the variance of the sample mean of a standard- 
ized Gaussian population with size j is 4. Now from eq. 
(21) we obtain 



will be scale invariant and thus there will be correla- 
tions among the pixels. These correlations will also be 
affected by the instrument noise. In addition, our anal- 
ysis has been limited so far to one dimensional maps 
while most CMB experiments are now obtaining two- 
dimensional maps. In order to take all these effects into 
account we need to apply the statistics of the SMD vari- 
able onto maps constructed by Monte Carlo simulations. 
This is the focus of the following section. 
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o~ 1 z ~ —erne 
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(22) 



III. MONTE-CARLO SIMULATIONS 



The variance of the /-dependent part of Z is 



°iz =< zl>-< z 2 > 2 



(23) 



where Z 2 = -2a[(l- /) ln(l -/) + / In/]. From eq. (20) 
we have < Z 2 >— a and < Zf > is easily obtained as 



<Zl>= J df Zl{f)~^o? 



Thus 



,2 _ J2 



9 2 In n 1 , 
a 2 



a z = a \,z + a 2,z - — r - 



(24) 



(25) 



In order to be able to distinguish between a Gaus- 
sian+Step map and a purely Gaussian one, at the ma 
level we demand that 



We start by constructing an array of n Gaussian ran- 
dom variables Xj , j = 1 , . . . , n with a power spectrum 
P(k) = k~ m . Thus the values Xj associated with the 
pixel j are obtained as the Fourier transform of a func- 
tion g(k) (k = 1, n) with the following properties: 

• For each k, the amplitude |<?(fc)| is an independent 
Gaussian random variable with mean and vari- 
ance P(k) = l/k m . 

• The phase 9k of each Fourier component g(k) is an 
independednt random variable in the range [0, 2n] 
with uniform probability distribution P(9k) = 

• The Fourier components are related by complex 
conjugation relations neeeded to give a real vari- 
able Xi. 



\Z a -£ - Z a=0 \ > m a\z 



(26) 



where we have used the variance o\z of a given realiza- 
tion. This implies that the minimum value of a, ct m i n 
that can be detected using this test is 



2 Inn 1/2 

ml Y /z 

n 



(27) 



0.2 which for 



and for n = O(10 3 ) we obtain a m i n - 
m = 1 is about an order of magnitude improvement over 
the corresponding sensitivity of tests based on the mo- 
ments skewness and kurtosis. The reason for this signif- 
icant improvement is the fact that the SMD statistical 
variable picks up the coherence properties introduced by 
the step function on the Gaussian map. The moments on 
the other hand pick up only local properties of the pix- 
els and do not amplify the long range coherence of the 
step-like discontinuity. 

Our analysis so far has assumed that the Gaussian 
variables Xj are independent and that the only correla- 
tion is introduced by the superposed step-function. In 
a realistic setup however the underlying Gaussian map 



The discrete Fourier transform definition used is 



i n 



(28) 



and the numerical programming was implemented using 
Mathematica |2(J. The array Xj obtained in the way 
described above is then standardized to the array X?, 
with 



X 



_ ^ - M ) 



(29) 



where fj, and a 2 are the sample mean and sample variance 
for the realization of the array Xj. A new array X'- is 
then constructed by superposing to the array X? a step 
function of amplitude 2a with discontinuity at a random 
point io- The array Xj is thus obtained as 



X S ; 



' i j — *o r 



j = i, . 



(30) 



Next we apply the statistics discussed in the previous 
section to several realizations of the arrays Xj and X' 



in an effort to find the most sensitive statistic that can 
distinguish among them. Our goal is to also find the 
minimum value of a that can be distinguished by that 
statistic at the la level, thus testing the analytical results 
of the previous section. 

We have used a lattice with 2000 pixels and a scale 
invariant power spectrum which for one-dimcsional data 
is P(k) — k^ 1 . In Table 1 we show the results for the 
skewness, the kurtosis and the average SMD for the Xj 
arrays, with a =0, 0.25, 0.50 and 1.0. The SMD average 
was obtained as in section 2 by first constructing the 
array of sample mean differences and then obtaining its 
average value, predicted to be equal to a by the analytical 
study of section 2. 

These statistics were applied to 50 random realiza- 
tions of the array Xj. The mean values of the statistics 
considered with their la standard deviations obtained 
over these 50 realizations are shown in the following Ta- 
ble 1. 

Table 1: A comparison of the effectiveness of the 
statistics considered, in detecting the presence of a co- 
herent step discontinuity with amplitude 2a relative to 
the standard deviation of the underlying scale invariant 
Gaussian map. 



a 


Skewness 


Kurtosis 


SMD Average 


0.00 


0.01 ±0.11 


2.97 ±0.19 


0.02 ±0.31 


0.25 


0.01 ±0.11 


2.95 ±0.20 


0.25 ±0.33 


0.50 


0.02 ± 0.11 


2.88 ± 0.21 


0.48 ±0.38 


1.00 


0.03 ±0.20 


2.82 ±0.32 


0.98 ±0.48 



The analytical prediction of section 2 for the SMD 
average value a is in good agreement with the results 
of the Monte Carlo simulations. The standard deviation 
of this result is not in such a good agreemnent with the 
analytical prediction because the assumption of complete 
independence among pixels made by the analytical treat- 
ment is not realized in the Monte Carlo simulations, here 
a scale invariant spectrum was considered and thus there 
was a non-trivial correlation among the pixels of the ar- 
rays. 

A simple way to further improve the sensitivity of the 
SMD statistical variable is to ignore a number I of bound- 
ary pixels of the SMD array, thus constructing its average 
using the Sample Mean Differences of pixels l + l, ...,n— I. 
From eq (21), the variance of the SMD for these pixels 
is significantly lower than the corresponding variance of 
the 21 pixels close to the boundaries. In addition, if the 
step is located within the central n — 21 pixels the SMD 
average may be shown to be larger than a thus further 
amplifying the step signature. For I = 150 the variance 
of the SMD average is reduced by about 20% while the 
SMD average is increased by about 20% thus allowing 
the detection of steps as low as a = 0.25 at the la level. 



The price to pay for this sensitivity improvement is the 
reduction of the effective pixel area where the search for 
steps is made. 

We have also used the SMD statistical variable for 
non-scale invariant power spectra and found that it works 
better for P(k) = k~ m with < m < 1 than for to > 1. 
This is to be expected because large values of m imply 
larger correlations among pixels which in turn leads to 
a smaller number of effectively independent pixels and 
thus a larger value for the variance of the SMD average. 

It is straightforward to generalize the one dimensional 
Monte Carlo simulations to two dimensions. In that case 
we use the two-dimensional discrete Fourier transform as 
an approximation to an expansion to spherical harmon- 
ics. This approximation is good for small area maps of 
the celestial sphere. We used the following definition of 
the two dimensional discrete Fourier transform. 

1 " 

X(i,j) = - 5 (fci,Me 2 ™ [( *- 1)(fa - 1)+(j - 1)(fe2 - 1)]/ " 

fel,fc2=l 

(31) 

refering to a n x n square lattice. In order to construct 
the background of scale invariant Gaussian fluctuations 
we used g{k\,k2) as a complex random variable. For 
scale invariance, the amplitude of g{k\, k 2 ) was obtained 
from a Gaussian probability distribution with mean 
and variance 

a 2 (k 1 ,k 2 ) = P(k 1 ,k 2 ) = 1 (32) 

The corresponding phase 9k lt k 2 f° r the (k\, k 2 ) mode 
was also determined randomly from a uniform probability 
distribution P{6f tl ,k 2 ) = ^ in order to secure Gaussian- 
ity for the map X(i,j). 

The corresponding map with a superposed coherent 
step discontinuity was obtained from the standardized 
Gaussian map X s (i,j) as 

X'(i,j)=X s (i,j) + a J .~ ai ~ b h , (33) 
\j-ai-b\ 

where 



X2 — X1 

b = yi — a x\ (35) 

i.e. the line of step discontinuity j — a i + b is deter- 
mined by the two random points (xi,yi) and (£2,2/2) of 
the map X(i,j). The skewness and kurtosis of the two 
maps are obtained in the usual way. For example for the 
standardized Gaussian map X s (i, j) we have 



» = ^E*'(M') 8 

n 

fc = ^E* s (* 



(36) 
(37) 



The SMD statistical variable is obtained by consider- 
ing a set of random straight lines bisecting the map and 
for each line taking the difference of the sample means 
from the two parts of the map. For example consider a 
line defined by the random points (xi,yi) and (£2,1/2) 
of the map. The line equation is j = a i + b with a, b 
obtained from eqs. (34) and (35). The SMD obtained 
from this line is 



SMD = 



Si 

m 



S2 
n 2 



where 



5 i = E E x'&i) 

i=l j=Max[(a i+6),l] 
n Min[(a i-\-b),n] 

^ = E E x "&i) 
»=i j=i 



(38) 

(39) 
(40) 



and ni, n,2 are the corresponding numbers of terms in 
the sums. For a Step+Gaussian map, the index s get 
replaced by '. 

The average and variance of the SMD is obtained by 
averaging over a large number of random test lines (a, b) 
and a large number of map realizations. The results of 
the application of the three statistics (skewness, kurtosis 
and SMD average) on 30 x 30 scale invariant Gaussian 
maps for various values of step amplitudes a are shown 
in Table 2. Uncorrelated Gaussian noise was also super- 
posed on the signal with signal to noise ratio 2.0. The 
random points defining the test lines were excluded from 
the outermost three rows and columns of the maps thus 
reducing somewhat the variance of the SMD average. 

Table 2: A comparison of the effectiveness of the 
statistics considered in two dimensional maps. The sig- 
nal to noise ratio was 2.0. Points defining the line dis- 
continuities were excluded from the three outermost rows 
and columns of the maps. 



a 


Skewness 


Kurtosis 


SMD Average 


0.00 


0.04±0.f3 


3.00 ±0.20 


0.01 ±0.03 


0.25 


0.02 ±0.08 


2.97±0.f3 


0.14 ±0.09 


0.50 


0.05±0.f4 


2.91 ± 0.24 


0.34 ±0.19 


f.00 


0.02 ± 0.24 


2.95 ±0.30 


0.56 ±0.31 



The results of Table 4 arc in qualitative agreement 
with those of Table 1 and with the analytical results valid 



for the one dimensional maps. Clearly the details of the 
one dimensional analysis are not valid in the two dimen- 
sional case and so the agreement can not be quantitative. 
The results still indicate however that the SMD statistic 
is significantly more sensitive compared to conventional 
statistics for the detection of coherent discontinuities on 
CMB maps. This statistic can detect coherent disconti- 
nuities with minimum amplitude a m i n — 0.5 at the la 
to 2cr level where a is the amplitude relative to the stan- 
dard deviation of the underlying scale invariant Gaussian 
map. 



IV. MAXIMUM SAMPLE DIFFERENCE 

An alternative statistic that can be significantly more 
sensitive than the SMD in certain cases is the Maximum 
Sample Difference (MSD). For an one dimensional set of 
data the MSD statistical variable Max{rk) is defined as 



Max(r k ) = Max( 



a(Y k ) 



) 



(41) 



where Y k is given by equations (15), (16) and a(Y k )) is 
the standard deviation of Y k given by 



a(Y k ) = 



1 



n — k 



(42) 



The variable r k has variance unity and mean 

(n - i )/(n - k) 



/j,(a, n, k, io) = 2a- 



c(fc) 



/j,(a, n, k, i ) = 2a 



ip/k 



1 < k < i (43) 
i < k < n - 1 (44) 
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Figure 2: he sample division described by k and the 
coherent discontinuity location described by io divide the 
2d pixel-map into four parts with corresponding number 
of pixels Ni, i = 1, ...,4. 



For two dimensional datasets the index k labels par- 
titions by which the two dimensional pixel-surface is di- 
vided in two parts. In the Monte-Carlo simulations stud- 
ied here we have considered only map divisions repre- 
sented by straight lines. It is straightforward however 
to generalize this to other types of divisions. In the 2d 
case Y k is generalized to the expression given in equation 
(38). Let the set of parameters io describe the location 
and shape of the coherent discotinuity in the 2d map (in 
the simplest case of a straight line discontinuity ig repre- 
sents only two numbers). Let also the set of parameters 
k describe the location of the sample division of the 2d 
map. With the sample division described by k and the 
coherent discontinuity location described by io the 2d 
pixel-map is divided into four parts with corresponding 
number of pixels Ni, i — 1, ...,4 (Figure 2). For simplic- 
ity hereafter we will omit the bar in io and k thus using 
the same notation as in the Id case. 



Thus 



Si 



^ +N 4 N 2 + N 3 



<r(Y k ) = 



1 



1 



with 



X +N 4 N 2 + N 3 



Yk , _ w k (io) 

= u k + 2a- 



a(Y k ) 



<r(Y k ) 



(45) 
(46) 

(47) 



where u k = 
variable and 



-tS^t is a standardized Gaussian random 



Wfe(loH 2 ( ivTT^4"7v^V3' 



(48) 



Define now a e ff = aw k (io). Clearly when the partition 
k coincides with the discontinuity io (N4 and N 2 — > 
0) we have a e // — > a. Otherwise |a e //| < |a|. The 
statistical variable r k is Gaussian with variance unity and 
mean 



< r k >-- 



2a 



eff 



< 



2a 



(49) 



a(Y k ) ~ a(Y k ) 

The Max(\r k \) after n trials (partitions) is therefore a 
sensitive function of a e ff (in the limit where we take 
all possible partitions we will also have a partition with 

a eff -» a). 

Now assume that after n trials (partitions) we found 
Max(\r k \) = Vo > 0. Since the variable u k of equation 
(53) is standardized Gaussian the probability p>(Vo) at 
each trial that we obtain a value Vq or larger for \r k \ is 



P>(Vo,a eff ) 



1 r°° 
J\v \ 



dr k e 



-|Vo| 



dr k e~ 



(50) 



Using the binomial distribution we find the probability 
for x values of rk above Vq after n partitions to be 



P x (n,V ,a ef f) 



x\{n — x)\ 



P>(V ,a eff ) x (l -p > (V ,a eff )) 



n-l 



(51) 



In our case we have only one occurence of Vo (since it 
is maximum) and the probability for this to happen is 
Pi(n, Vo, ct e ff). Thus, from a 2d pixel-map we can mea- 
sure Vo (the maximum of r k , n (the number of divisions 
used in the test) and cr(Yk ) (for the partition k that 
corresponds to Vo)). With this input we obtain the prob- 
ability distribution P\{a e ff) given n, Vo and o~(Yk ). For 
example assume that we measured Vo with 100 divisions 
(n = 100) in a 30 x 30 pixel map. A reasonable value of 
cr(Yfe ) (to be obtained exactly from the data) is 



1 



Ni + N 4 N 2 + N 3 



2 



0.07 (52) 



Given n, Vo and a(Y ko ), the probability distribution 
Pi(a e ff) is an even function of a e ff, completely deter- 
mined and has maxima P^a^F) at ±a^ a f- For larger 
Vq we expect larger a™^f . 

For example, it may be easily verified using equation 
(57) and the package M athematica that with n = 100 we 



have la' 



eff 



for V < 



\rcrit 
V 



2.5. In general, given 



n, Vq and a(Y k ~ ), we can determine the probability dis- 
tribution for a e f f and therefore the most probable value 
of a e ff (a lower bound on a) can be found. This most 
probable value is a^ff and the corresponding probability 
is Pi(a^y : )- We also find the probability that there is no 
coherent discontinuity on the map as Pi(a e // = 0) (for 
Vo < V crtt it is most probable that there is no coherent 
discontinuity on the map). 
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Figure 3: plot of Pi(a e //) for Vo = 2.6 (continous 
line), Vq = 3.0 (dotted line) and Vq = 4.5 (dashed line). 



In Figure 3 we show a plot of Pi(a e //) for Vo = 2.6 
(continous line), Vq = 3.0 (dotted line) and Vq = 4.5 
(dashed line). Clearly for Vo < 2.6 it is most proba- 
ble that there is no coherent discontinuity on the map 
(a™ f a f x = 0) with Pi(a eff = 0) = 0.33. On the other 
hand for Vo = 3.0, the most probable value of a e ff is 
\a™ f f \ = 0.04 with probability Pi{a™ f J) ~ 0.37 while 
the probability that there is no coherent discontinuity on 
the map is P\{a e ff = 0) ~ 0.16. 

It is important to verify the above analytical results 
using Monte Carlo simulations of 2d data. We consid- 
ered 2d 30 x 30 data-sets as described in section 3 with 
uncorrelated standardized Gaussian data (white noise). 
On these we superpose a coherent discontinuity with am- 
plitude 2a with a in the range [0,0.45]. For each a we 
construct 10 maps and find Vq and its standard devi- 
ation er(Vb). We also find cr(y fe - ) which was practically 
constant at o~(Y^ g ) ~ 0.07 as predicted analytically. With 
the input Vq and cr(Yfc ) we construct Pi(a e //) and find 
|ajnas|, p 1 ( a ™° a: ) an( j P 1 (0). These results are shown in 
Table 5. 

Table 5: he effectiveness of the MSD statistic con- 
sidered in two dimensional maps with a flat spectrum of 
fluctuations (white noise). First column shows the mag- 
nitude of the coherent discontinuity superposed on the 
standardized Gaussian map and the fourth column shows 
shows the derived most probable value value of a e f / (the 
lower bound of a) based on the MSD statistic. 



a 


Vo 






Pi(0) 


Pi(\< f r\) 


0.00 


2.6 ±0.5 


0.07 


0.0 ±0.01 


0.36 


0.36 


0.03 


2.8 ±0.6 


0.07 


0.02 ±0.02 


0.31 


0.37 


0.06 


3.4 ±0.7 


0.07 


0.04 ±0.02 


0.06 


0.37 


0.1 


4.1 ±0.7 


0.07 


0.06 ±0.03 


0.005 


0.37 


0.2 


5.6 ±0.7 


0.07 


0.11 ±0.03 


0.0 


0.37 


0.25 


7.5 ±0.7 


0.07 


0.18 ±0.03 


0.0 


0.37 


0.3 


9.0 ±0.9 


0.07 


0.24 ±0.03 


0.0 


0.37 


0.35 


9.8 ±0.8 


0.07 


0.27 ±0.03 


0.0 


0.37 


0.4 


10.4 ± 1.1 


0.07 


0.29 ±0.03 


0.0 


0.37 


0.45 


12.4 ±1.6 


0.07 


0.36 ±0.04 


0.0 


0.37 



Comparing Oi e ff with a we confirm that in all sim- 



ulated cases la 



eff 



is a lower bound on lal. It is also 



clear from Table 5 that the MSD method can detect the 



presence of a coherent discontinuity with a 
probability ratio 



P 1 (a eff =0M) 
Pi(a ef f=0) 



7 



0.04 with 



(53) 



The MSD statistic is significantly more sensitive in de- 
tecting coherent discontinuities compared to the SMD 
statistic of sections 2, 3. We expect however that the 
sensitivity of this statistic will be significantly reduced 
when correlations are introduced in the data. We have 
shown using additional Monte-Carlo simulations that this 



statistic is not as robust as the SMD statistic. In partic- 
ular when we include correlations in the data (e.g. scale 
invariance), the sensitivity of the MSD statistic drops 
rapidly to the level of the SMD statistic i.e. it can de- 
tect a coherent discontinuity with a ~ 0.4. This implies 
that the MSD statistic is more useful in detecting the 
presence of coherent discontinuities only when applied to 
noise dominated data. 



V. CONCLUDING REMARKS 

An important issue that needs to be clarified is the 
following: 'What are the effects of other strings giving rise 
to their own step discontinuity? Do they decrease the 
sensitivity of the suggested statistical tests?' 

No attempt is made in this paper to model the fluc- 
tuations of 'other strings'. Any such attempt (even those 
of simulations) is faced with the possibility of serious er- 
rors due to incorrect assumptions. Even basic features of 
the string scaling solution are still under serious debate. 
For example there have been serious claims recently |2l| l 
that realistic field theoretical cosmological simulations of 
gauged string evolution would have no wiggles for long 
strings and no loop component. In addition, the physical 
processes affecting the CMB photons are not well known 
especially in defect based models. The issues of reioniza- 
tion, fluctuations present on the last scattering surface, 
wiggles of long strings and other effects have only been 
crudely modeled so far. 

Instead of attempting a rough modeling of these ef- 
fects we have made a very robust and reasonable assump- 
tion: The statistics of CMB fluctuations induced by a 
string network on large angular scales are either gaus- 
sian (as was the common belief so far) or minimally non- 
gaussian in the sense that the only non-gaussianity is due 
to a late long string. Additional types of non-gaussianity 
are not excluded but they would simply make the de- 
tection of non-gaussianity easier by using the proposed 
(or other) tests. In that sense the proposed tests would 
only be able to find a lower bound on Gfj, which however 
turns out to be cosmologically quite interesting given the 
optimum sensitivity of the tests for detecting the above 
defined 'minimal non-gaussianity'. 

The question that has been addressed in this pa- 
per is the following: Given the presently known values 
for 43- from COBE on large angular scales, what is 

1 rms 

the minimum value of G\i detectable under the above 
stated assumption of 'minimal non-gaussianity' ? Using 
the SMD or MSD statistics which are optimized to de- 
tect coherent temperature discontinuities on top of Gaus- 
sian temperature maps we may obtain non-trivial upper 
or even lower bounds on the values of G[iv s j s which are 



highly robust and independent of the details of the string 
evolution and the resolution of the CMB maps. Applica- 
tion of these statistics on the COBE data is currently in 
progress JL9|. 
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